NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation

Yang, Chenxi; Anderson, Greg; Chaudhuri, Swarat (May 2024, IEEE Conference on Secure and Trustworthy Machine Learning (SatML))

Full Text Available
Guiding Safe Exploration with Weakest Preconditions

Anderson, Greg; Chaudhuri, Swarat; Dillig, Isil (January 2023, International Conference on Learning Representations (ICLR))

In reinforcement learning for safety-critical settings, it is often desirable for the agent to obey safety constraints at all points in time, including during training. We present a novel neurosymbolic approach called SPICE to solve this safe exploration problem. SPICE uses an online shielding layer based on symbolic weakest preconditions to achieve a more precise safety analysis than existing tools without unduly impacting the training process. We evaluate the approach on a suite of continuous control benchmarks and show that it can achieve comparable performance to existing safe learning techniques while incurring fewer safety violations. Additionally, we present theoretical results showing that SPICE converges to the optimal safe policy under reasonable assumptions.
more » « less
Full Text Available
Neurosymbolic Reinforcement Learning with Formally Verified Exploration

Anderson, Greg; Verma, Abhinav; Dillig, Isil; Chaudhuri, Swarat (January 2021, 34th Conference on Neural Information Processing Systems)

Full Text Available
Neurosymbolic Reinforcement Learning with Formally Verified Exploration

Anderson, Greg; Verma, Abhinav; Dillig, Isil; Chaudhuri, Swarat (October 2020, Neural Information Processing Systems)
null (Ed.)
We present Revel, a partially neural reinforcement learning (RL) framework for provably safe exploration in continuous state and action spaces. A key challenge for provably safe deep RL is that repeatedly verifying neural networks within a learning loop is computationally infeasible. We address this challenge using two policy classes: a general, neurosymbolic class with approximate gradients and a more restricted class of symbolic policies that allows efficient verification. Our learning algorithm is a mirror descent over policies: in each iteration, it safely lifts a symbolic policy into the neurosymbolic space, performs safe gradient updates to the resulting policy, and projects the updated policy into the safe symbolic subset, all without requiring explicit verification of neural networks. Our empirical results show that Revel enforces safe exploration in many scenarios in which Constrained Policy Optimization does not, and that it can discover policies that outperform those learned through prior approaches to verified exploration.
more » « less
Full Text Available
Neurosymbolic Reinforcement Learning with Formally Verified Exploration

Anderson, Greg; Verma, Abhinav; Dillig, Isil; Chaudhuri, Swarat (January 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Optimization and abstraction: a synergistic approach for analyzing neural network robustness

https://doi.org/10.1145/3314221.3314614

Anderson, Greg; Pailoor, Shankara; Dillig, Isil; Chaudhuri, Swarat (July 2019, Symposium on Programming Languages Design and Implementation (PLDI), 2019)

In recent years, the notion of local robustness (or robustness for short) has emerged as a desirable property of deep neural networks. Intuitively, robustness means that small perturbations to an input do not cause the network to perform misclassifications. In this paper, we present a novel algorithm for verifying robustness properties of neural networks. Our method synergistically combines gradient-based optimization methods for counterexample search with abstraction-based proof search to obtain a sound and (δ -)complete decision procedure. Our method also employs a data-driven approach to learn a verification policy that guides abstract interpretation during proof search. We have implemented the proposed approach in a tool called Charon and experimentally evaluated it on hundreds of benchmarks. Our experiments show that the proposed approach significantly outperforms three state-of-the-art tools, namely AI^2, Reluplex, and Reluval.
more » « less
Full Text Available

Search for: All records